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DETECTION OF CYP3A4 AND CYP2C9 POLYMORPHISMS 

The present invention is directed to methods of preparing biological samples for 
nucleic acid analysis using oligonucleotide primers suitable for amplification of the genes 
encoding the drug-metabolizing cytochrome P450 enzymes CYP3A4 and CYP2C19. 

BACKGROUND OF THE INVENTION 

Xenobiotics are pharmacologically, endocrinological^, or toxicologically active 
substances foreign to a biological system. Most xenobiotics, including pharmaceutical 
agents, are metabolized through two successive reactions. Phase I reactions 
(functionalization reactions), include oxidation, reduction, and hydrolysis, in which a 
derivatizable group is added to the original molecule. Functionalization prepares the 
drug for further metabolism in phase II reactions. During phase II reactions (conjugative 
reactions, which include glucoronidation, sulfation, methylation and acetylation), the 
functionalized drug is conjugated with a hydrophilic group. The resulting hydrophilic 
compounds are inactive and excreted in bile or urine. Thus, metabolism can result in 
detoxification and excretion of the active substance. Alternatively, an inert xenobiotic 
may be metabolized to an active compound. For example, a pro-drug may be converted 
to a biologically active therapeutic or toxin. 

The cytochrome P450 (C YP) enzymes are involved in the metabolism of many 
different xenobiotics. CYPs are a superfamily of heme-containing enzymes, found in 
eukaryotes (both plants and animals) and prokaryotes, and are responsible for Phase I 
reactions in the metabolic process. In total, over 500 genes belonging to the CYP 
superfamily have been described and divided into subfamilies, CYP1-CYP27. In 
humans, more than 35 genes and 7 pseudogenes have been identified. Members of three 
CYP gene families, CYP1, CYP2, and CYP3, are responsible for the majority of drug 
metabolism. The human CYPs which are of greatest clinical relevance for the 
metabolism of drugs and other xenobiotics are CYP1A2, CYP2A6, CYP2C9, CYP2C19, 
CYP2D6, CYP2E1 and CYP3A4. The liver is the major site of activity of these 
enzymes, however CYPs are also expressed in other tissues. 
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The most important drag-metabolizing CYP enzyme is CYP3 A4, which is the 
major CYP expressed in liver. Expression of the gene encoding CYP3A4 (CYP3A4) is 
inducible by many commonly used drugs, such as dexamethasone, rifampicin, and 
clotrimazole. CYP3A4 is estimated to metabolize more than 60% of all drugs in clinical 
5 use, including calcium channel blockers such as nifedipine, immunosuppressants such as 
cyclosporin A, macrolide antibiotics such as erythromycin, and steroid hormones. In 
addition, CYP3A4 metabolizes some carcinogens, and may be implicated in an 
individual's susceptibility to such toxins. 

The existence of more than one form of the CYP3A4 enzyme is caused by 
10 polymorphisms in the gene which encodes the CYP3A4 enzyme (the gene being denoted 
in italics, as CYP3A4). In fact, almost 20 polymorphisms in the CYP3A4 gene have been 
q described (see http;//www.imm.ki.se/cypalleles/ for listing). The distribution of 
^ particular CYP3A4 polymorphisms differs among ethnic groups, however, concomitant 
*Pjj differences in CYP3 A4 activity and responses to drugs which are CYP3A4 substrates 

hi 

11 15 remain to be investigated. CFP3A4*iA is the wild type gene, corresponding to the cDNA 

f!l having GenBank Accession No. A18907 and the genomic DNA having GenBank 

if! 

s Accession No. AF280107. A number of mutations in the 5' untranslated region of 

jj CYP3A4 have been described. CYP3A4HB is an A to G substitution at position -392. 

W CYP3A4HC is a T to G substitution at position -444. CYP3A4HD is a C to A 

l3 

p 20 substitution at position -62. CYP3A4*1E is a T to A substitution at position -369. 

'* CYP3A4*1F is a C to G substitution at -747. The 5' flanking region of CYP3A4 is set 
forth in SEQ ID NO:l and in Figure 1. 

WO 01/20025 discloses single nucleotide polymorphisms in various exons, 
introns, and in the 3' UTR of CYP3A4, as well as oligonucleotides for use in diagnosing 
25 and treating abnormal expression and/or function of this gene. WO 00/24926 discloses 
oligonucleotides for use in detecting an A to G point mutation at position -290 of 
CYP3A4. WO 99/13106 discloses polymorphisms in CYP3A4, including an A to G 
substitution at position -392 of the promoter, at the 7 th position of the 10 bp NFSE, within 
oligonucleotides having sequences ACAAGGGCAAGAGAGAGGC (SEQ ID NO:2) 
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and ACAAGGGCAGGAGAGAGGC (SEQ ID NO:3), with polymorphic variants 
indicated in bold type. 

U.S.Pat.No. 6,174,684 and corresponding WO 00/09752 disclose an A to G 
variant in the nifedipine-specific regulatory element located at positions -287 to -296 of 
5 CYP3A4, which is associated with increased risk of prostate cancer and with increased 
risk of developing leukemia after administration of an epipodophyllotoxin. U.S.Pat.No. 
6,174,684 also discloses the oligonucleotides AGGGCAAGAG (SEQ ID NO:4) and 
AGGGCAGGAG (SEQ ID NO:5), with polymorphic variants indicated in bold type. 
Rebbeck, et al. (1998) J. Natl Cancer Inst 90, 1225-1229 also describes this association 
10 between prostate cancer, leukemia, and the A to G mutation. 

Kuehl, et ah (2001) Nature Genetics 27, 383-391 discloses mutations at positions 
m -341, -288, and -43 of the CYP3A4 promoter, none of which were associated with altered 

"5 CYP3A4 activity. Kuehl, et al also discloses differential distribution of these 
W polymorphisms among Caucasians and African Americans. 

15 A second important CYP enzyme is CYP2C9, which is active in hydroxylation of 

y such drugs as tolbutamide, phenytoin, S-warfarin, diclofenac, ibuprofen, and losarten. 

0 The sequence of CYP2C9 is set forth in SEQ ID NO:6. Six variants in CYP2C9 are 

ill 

1 j described on the CYP web site, and another six variant designations are listed without 
W descriptions. The CYP2C9*! variant is designated as the wild type. Four of the five 
H 20 polymorphic CYP2C9 forms described contain mutations in the coding regions of the 

gene that result in decreased in vitro activity, and the remaining variant, CYP2C9*6, is a 
deletion of an A at position 818 which results in a frame shift. 

WO00/12757 discloses primer extension assays and kits for detection of the 
single nucleotide polymorphisms CYP2C9*2 and CYP2C9*3, both of which result in 

25 amino acid substitutions. 

On the basis of ability of metabolize a marker drug such as nifedipine for 
CYP3A4 or S-warfarin for CYP2C9, individuals may be characterized as poor 
metabolizers (PM), intermediate metabolizers (IM), extensive metabolizers (EM) or ultra 
extensive metabolizers (UEM or UM) for CYP3A4 or CYP2C9 substrates, respectively. 

30 Poor metabolizers retain the substrate in their bodies for a relatively long period of time, 
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and are susceptible to toxicity and side effects at "normal" dosages. Ultraextensive 
metabolizers clear the substrate from their bodies quickly, and require higher than 
"normal" dosages to achieve a therapeutic effect. Intermediate and extensive 
metabolizers retain the substrate in their bodies for times between those of PMs and 
5 UEMs, and are more likely to respond to "normal" dosages of the drug. However, 
individuals characterized as IM or EM may differ in drug clearance by as much as 10- 
fold, and variations in toxicity, side effects, and efficacy for a particular drug may occur 
among these individuals. However, administration of such drugs to determine an 
individual's metabolic capacity may in itself be dangerous, exposing the individual to 
10 potential toxic side effects. 

A need remains for methods of preparing biological samples that contain the 5 1 

■ % n flanking regions of CYP3A4 or CYP2C9, so that this information may be used to predict 

'if differential capacities for metabolizing CYP3 A4 and CYP2C9 substrates among 

W individuals. 

u 15 

if f'f 
$ 

w 
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SUMMARY OF THE INVENTION 

The present inventors have discovered a novel single nucleotide polymorphism in 
the 5' flanking region of CYP3A4, and six novel polymorphisms in the 5' flanking region 
of CYP2C9. Oligonucleotides have been devised for amplification of the polymorphic 
5 regions corresponding to these polymorphisms. These oligonucleotides may be used to 
prepare biological samples for further analysis of the 5' flanking regions of these genes. 
The inventors have also devised sequence determination oligonucleotides for use as 
probes for the novel single nucleotide polymorphisms in CYP3A4 and CYP2C9. 

In one embodiment, the invention provides an oligonucleotide primer pair suitable 
10 for amplifying a polymorphic region of a 5' flanking region of a CYP3A4 gene, wherein 
the polymorphic region corresponds to position 461 of SEQ ID NO:l, which position 
may also be described as position -644 from the transcription start site of the CYP3A4 
gene. 

In another embodiment, the invention provides a sequence determination 
15 oligonucleotide for detecting a polymorphic site in a 5' flanking region of a CYP3A4 
gene, said oligonucleotide being complementary to the polymorphic region 
corresponding to position 461 of SEQ ID NO:l. 

In another embodiment, the invention provides a kit for amplification and/or 
detection of a polymorphic region of the 5' flanking region of a CYP3A4 gene, said kit 
p 20 comprising at least one oligonucleotide primer pair capable of amplifying the region 
corresponding to position 461 of SEQ ID NO:l. 

In another embodiment, the invention provides an oligonucleotide primer pair 
suitable for amplifying a polymorphic region of a 5' flanking region of a CYP2C9 gene, 
wherein the polymorphic region corresponds to position 957 of SEQ ID NO:6; position 
25 1049 of SEQ ID NO:6; position 1 164 of SEQ ID NO;6; position 1526 of SEQ ID NO:6; 
position 1661 of SEQ ID NO:6; and position 1662 of SEQ ID NO:6. Position 957 of 
SEQ ID NO:6 may also be described as position -1 189 from the transcription start site of 
the CYP3C9 gene; position 1049 of SEQ ID NO:6 may also be described as position 
-1097 from the transcription start site; position 1 164 of SEQ ID NO:6 may also be 
30 described as position -982 from the transcription start site; position 1526 of SEQ ID 
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NO:6 may also be described as position -620 from the transcription start site; position 
1661 of SEQ ID NO:6 may also be described as position -485 from the transcription start 
site; and position 1662 of SEQ ID NO:6 may also be described as position -484 from the 
transcription start site. 

5 In yet another embodiment, the invention provides a sequence determination 

oligonucleotide for detecting a polymorphic site in a 5' flanking region of a CYP2C9 
gene, said oligonucleotide comprising a sequence selected from the group consisting of 
an oligonucleotide complementary to the polymorphic region corresponding to position 
957 of SEQ ID NO:6; an oligonucleotide complementary to the polymorphic region 
10 corresponding to position 1049 of SEQ ID NO:6; an oligonucleotide complementary to 
the polymorphic region corresponding to position 1164 of SEQ ID NO:6; an 
Q oligonucleotide complementary to the polymorphic region corresponding to position 
^ 1526 of SEQ ID NO:6; an oligonucleotide complementary to the polymorphic region 
*P corresponding to position 1661 of SEQ ID NO:6; and an oligonucleotide complementary 
|i 15 to the polymorphic region corresponding to position 1662 of SEQ ID NO:6. 

In another embodiment, the invention provides a kit for amplification and/or 
* detection of a polymorphic region corresponding to at least one polymorphic region in 

ifs the 5' flanking region of the CYP2C9 gene, said region being selected from the group 
W consisting of position 957 of SEQ ID NO:6; position 1049 of SEQ ID NO:6; position 

13 20 1 164 of SEQ ID NO:6; position 1526 of SEQ ID NO:6; position 1661 of SEQ ID NO:6; 
and position 1 662 of SEQ ID NO:6. 

BRIEF DESCRIPTION OF THE DRAWINGS 

25 Figure 1 shows the sequence of the 5' flanking region of the CYP3A4 gene as set 

forth in SEQ ID NO: 1, with the novel polymorphic site underlined and highlighted in 
bold. 

Figure 2 shows the sequence of the 5 'flanking region of the CYP2C9 gene as set 
forth in SEQ ID NO:6, with the novel polymorphic sites underlined and highlighted in 
30 bold. 
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DETAILED DESCRIPTION OF THE INVENTION 

The U.S. patents and publications referenced herein are hereby incorporated by 
reference. 

For the purposes of the invention, certain terms are defined as follows. 
5 "Gene" is defined as the genomic sequence of the CYP2C19 gene. 

"Oligonucleotide" means a nucleic acid molecule preferably comprising from 
about 8 to about 50 covalently linked nucleotides. More preferably, an oligonucleotide of 
the invention comprises from about 8 to about 35 nucleotides. Most preferably, an 
oligonucleotide of the invention comprises from about 10 to about 25 nucleotides. In 
10 accordance with the invention, the nucleotides within an oligonucleotide may be analogs 
or derivatives of naturally occurring nucleotides, so long as oligonucleotides containing 
f 3 such analogs or derivatives retain the ability to hybridize specifically within the 
\? polymorphic region containing the targeted polymorphism. Analogs and derivatives of 
45 naturally occurring oligonucleotides within the scope of the present invention are 
J: 15 exemplified in U.S. Pat Nos. 4,469,863; 5,536,821; 5,541,306; 5,637,683; 5,637,684; 
ft 5,700,922; 5,717,083; 5,719,262; 5,739,308; 5,773,601; 5,886,165; 5,929,226; 5,977,296; 

« 6,140,482; WO 00/56746; WO 01/14398, and the like. Methods for synthesizing 

!3 

fi oligonucleotides comprising such analogs or derivatives are disclosed, for example, in the 

W patent publications cited above and in U.S. Pat. Nos. 5,614,622; 5,739,314; 5,955,599; 
S 20 5,962,674; 6,117,992; in WO 00/75372, and the like. The term "oligonucleotides" as 
^ defined herein also includes compounds which comprise the specific oligonucleotides 
disclosed herein, covalently linked to a second moiety. The second moiety may be an 
additional nucleotide sequence, for example, a tail sequence such as a polyadenosine tail 
or an adaptor sequence, for example, the phage M13 universal tail sequence, and the like. 
25 Alternatively, the second moiety may be a non-nucleotidic moiety, for example, a moiety 
which facilitates linkage to a solid support or a label to facilitate detection of the 
oligonucleotide. Such labels include, without limitation, a radioactive label, a fluorescent 
label, a chemiluminescent label, a paramagnetic label, and the like. The second moiety 
may be attached to any position of the specific oligonucleotide, so long as the 
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oligonucleotide retains its ability to hybridize to the polymorphic regions described 
herein. 

A polymorphic region as defined herein is a portion of a genetic locus that is 
characterized by at least one polymorphic site. A genetic locus is a location on a 
chromosome which is associated with a gene, a physical feature, or a phenotypic trait. A 
polymorphic site is a position within a genetic locus at which at least two alternative 
sequences have been observed in a population. A polymorphic region as defined herein 
is said to "correspond to" a polymorphic site, that is, the region may be adjacent to the 
polymorphic site on the 5' side of the site or on the 3' side of the site, or alternatively may 
contain the polymorphic site. A polymorphic region includes both the sense and 
antisense strands of the nucleic acid comprising the polymorphic site, and may have a 
length of from about 100 to about 5000 base pairs. For example, a polymorphic region 
may be all or a portion of a regulatory region such as a promoter, 5' UTR, 3' UTR, an 
intron, an exon, or the like. A polymorphic or allelic variant is a genomic DNA, cDNA, 
mRNA or polypeptide having a nucleotide or amino acid sequence that comprises a 
polymorphism. A polymorphism is a sequence variation observed at a polymorphic site, 
including nucleotide substitutions (single nucleotide polymorphisms or SNPs), insertions, 
deletions, and microsatellites. Polymorphisms may or may not result in detectable 
differences in gene expression, protein structure, or protein function. Preferably, a 
polymorphic region of the present invention has a length of about 1000 base pairs. More 
preferably, a polymorphic region of the invention has a length of about 500 base pairs. 
Most preferably, a polymorphic region of the invention has a length of about 200 base 
pairs. 

A haplotype as defined herein is a representation of the combination of 
polymorphic variants in a defined region within a genetic locus on one of the 
chromosomes in a chromosome pair. A genotype as used herein is a representation of the 
polymorphic variants present at a polymorphic site. 

The PCR primer pairs of the invention are capable of amplifying the polymorphic 
region corresponding to position 461 of SEQ ID NO:l, or any of the polymorphic regions 
corresponding to position 957 of SEQ ID NO:6; position 1049 of SEQ ID NO:6; position 
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1 164 of SEQ ID NO:6; position 1526 of SEQ ID NO:6; position 1661 of SEQ ID NO:6; 
and position 1662 of SEQ ID NO:6. Specific oligonucleotide primer pairs of the 
invention, for amplifying position 461 of SEQ ID NO:l, may comprise sequences 
selected from the group consisting of SEQ ID NO:7 and SEQ ID NO: 8; and SEQ ID 
5 NO;9 and SEQ ID NO: 10. For amplifying only position 957 of SEQ ID NO:6, an 
oligonucleotide primer pair comprising the sequences set forth in SEQ ID NO: 19 and 
SEQ ID NO:20 may be used. Alternatively, positions 957 and 1049 of SEQ ID NO:6 
may be amplified using an oligonucleotide primer pair comprising the sequences set forth 
in SEQ ID NO:21 and SEQ ID NO:22; or positions 957, 1049, and 1164 may be 
10 amplified using an oligonucleotide primer pair comprising the sequences set forth in SEQ 
ID NO:23 and SEQ ID NO:24. Position 1164 of SEQ ID NO:6 may also be amplified 
13 using an oligonucleotide primer pair comprising the sequences set forth in SEQ ID 
V i NO:25 and SEQ ID NO:26. Positions 1526, 1661, and 1662 of SEQ ID NO:6 may be 

fl» amplified using an oligonucleotide primer pair comprising the sequences set forth in SEQ 

i i i 

U 15 ID NO:27 and SEQ ID NO:28. Positions 1661 and 1662 of SEQ ID NO:6 may be 
f 2 amplified using an oligonucleotide primer pair selected from the group consisting of an 
« oligonucleotide primer pair comprising the sequences set forth in SEQ ID NO:29 and 

m SEQ ID NO: 30 and an oligonucleotide primer pair comprising the sequences set forth in- 

W SEQ ID NO:31 and SEQ ID NO:32. 

P 20 Each of the PCR primer pairs of the invention may be used in any PCR method. 

? For example, a PCR primer pair of the invention may be used in the methods disclosed in 

U.S.Pat.Nos. 4,683,195; 4,683,202, 4,965,188; 5,656,493; 5,998,143; 6,140,054; WO 
01/27327; WO 01/27329; and the like. The PCR pairs of the invention may also be used 
in any of the commercially available machines that perform PCR, such as any of the 
25 GeneAmp® Systems available from Applied Biosystems. 

The oligonucleotides of the invention may be used to determine the sequence of 
the polymorphic regions of SEQ ID NO: 1 or SEQ ID NO:6 as defined herein. In one 
embodiment, an oligonucleotide of the invention comprises a sequence selected from the 
group consisting of SEQ ID NO:ll; SEQ ID NO:12; SEQ ID NO:13; SEQ ID NO:14; 
30 SEQ ID NO:15; SEQ ID NO:16; SEQ ID NO:17; and SEQ ID NO:18, for determining 
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the sequence of the novel polymorphic region of CYP3A4 corresponding to position 461 
of SEQ ID NO:l. In another embodiment, for determining the sequence of the 
polymorphic region of CYP2C9 corresponding to position 957 of SEQ ID NO:6, an 
oligonucleotide of the invention comprises a sequence selected from the group consisting 

5 of SEQ ID NO:33; SEQ ID NO:34; SEQ ID NO:43; SEQ ID NO:44; SEQ ID NO:53; 
SEQ ID NO:58; SEQ ID NO:63; and SEQ ID NO:68. In another embodiment, for 
determining the sequence of the polymorphic region of CYP2C9 corresponding to 
position 1049 of SEQ ID NO:6, an oligonucleotide of the invention comprises a sequence 
selected from the group consisting of SEQ ID NO:35; SEQ ID NO:36; SEQ ID NO:45; 

10 SEQ ID NO:46; SEQ ID NO:54; SEQ ID NO:59; SEQ ID NO:64; and SEQ ID NO:69. 
In another embodiment, for determining the sequence of the polymorphic region of 
CYP2C9 corresponding to position 1164 of SEQ ID NO:6, an oligonucleotide of the 
invention comprises a sequence selected from the group consisting of SEQ ID NO:37; 
SEQ ID NO:38; SEQ ID NO:45; SEQ ID NO:48; SEQ ID NO:55; SEQ ID NO:60; SEQ 

15 ID NO:65; and SEQ ID NO:70. In another embodiment, for determining the sequence of 
the polymorphic region of CYP2C9 corresponding to position 1526 of SEQ ID NO:6, an 
oligonucleotide of the invention comprises a sequence selected from the group consisting 
of SEQ ID NO:39; SEQ ID NO:40; SEQ ID NO:49; SEQ ID NO:50; SEQ ID NO:56; 
SEQ ID NO:61; SEQ ID NO:66; and SEQ ID NO:71. In another embodiment, for 

20 determining the sequences of the polymorphic region of CYP2C9 corresponding to either 
of positions 1661 or 1662 of SEQ ID NO:6, an oligonucleotide of the invention 
comprises a sequence selected from the group consisting of SEQ ID NO:41; SEQ ID 
NO:42; SEQ ID NO:51; SEQ ID NO:52; SEQ ID NO:57; SEQ ID NO:62; SEQ ID 
NO:67;andSEQIDNO:72. 

25 Those of ordinary skill will recognize that oligonucleotides complementary to the 

polymorphic regions described herein must be capable of hybridizing to the polymorphic 
regions under conditions of stringency such as those employed in primer extension-based 
sequence determination methods, restriction site analysis, nucleic acid amplification 
methods, ligase-based sequencing methods, methods based on enzymatic detection of 

30 mismatches, microarray-based sequence determination methods, and the like. The 
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oligonucleotides of the invention may be synthesized using known methods and 
machines, such as the ABI™3900 High Throughput DNA Synthesizer and the Expedite™ 
8909 Nucleic Acid Synthesizer, both of which are available from Applied Biosystems 
(Foster City,CA). 

5 The oligonucleotides of the invention may be used, without limitation, as in situ 

hybridization probes or as components of diagnostic assays. Numerous oligonucleotide- 
based diagnostic assays are known. For example, primer extension-based nucleic acid 
sequence detection methods are disclosed in U.S.Pat.Nos. 4,656,127; 4,851,331; 
5,679,524; 5,834,189; 5,876,934; 5,908,755; 5,912,118; 5,976,802; 5,981,186; 6,004,744; 

10 6,013,431; 6,017,702; 6,046,005; 6,087,095; 6,210,891; WO 01/20039; and the like. 
Primer extension-based nucleic acid sequence detection methods using mass 
spectrometry are described in U.S.Pat.Nos. 5,547,835; 5,605,798; 5,691,141; 5,849,542; 
5,869,242; 5,928,906; 6,043,031; 6,194,144, and the like. The oligonucleotides of the 
invention are also suitable for use in ligase-based sequence determination methods such 

15 as those disclosed in U.S.Pat.Nos. 5,679,524 and 5,952,174, WO 01/27326, and the like. 
The oligonucleotides of the invention may be used as probes in sequence determination 
methods based on mismatches, such as the methods described in U.S.PatNos. 5,851,770; 
5,958,692; 6,1 10,684; 6,183,958; and the like. In addition, the oligonucleotides of the 
invention may be used in hybridization-based diagnostic assays such as those described 

20 in U.S.Pat.Nos. 5,891,625; 6,013,499; and the like. 

The oligonucleotides of the invention may also be used as components of a 
diagnostic microarray. Methods of making and using oligonucleotide microarrays 
suitable for diagnostic use are disclosed in U.S.Pat.Nos. 5,492,806; 5,525,464; 5,589,330; 
5,695,940; 5,849,483; 6,018,041; 6,045,996; 6,136,541; 6,142,681; 6,156,501; 6,197,506; 

25 6,223,127; 6,225,625; 6,229,911; 6,239,273; WO 00/52625; WO 01/25485; WO 
01/29259; and the like. 

The invention is also embodied in a kit comprising at least one oligonucleotide 
primer pair of the invention. When the kit is used for amplification and detection of 
CYP3A4 polymorphisms, it will comprise an oligonucleotide primer pair suitable for 

30 amplification of the polymorphic region corresponding to position 461 of SEQ ED NO:l. 
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Specific primer pairs in this embodiment are selected from the group consisting of SEQ 
ID NO:7 and SEQ ID NO:8; and SEQ ID NO;9 and SEQ ID NO: 10. This embodiment of 
the kit of the invention may optionally comprise a sequence determination 
oligonucleotide selected from the group consisting of SEQ ID NO:ll; SEQ ID NO: 12; 
5 SEQ ID NO:13; SEQ ID NO:14; SEQ ID NO:15; SEQ ID NO:16; SEQ ID NO:17; and 
SEQ ID NO: 18. 

When the kit of the invention is used for amplification and detection of 
polymorphisms in the 5' flanking region of CYP2C9, it will comprise at least one 
oligonucleotide primer pair, wherein the primer pair is capable of amplifying a 
10 polymorphic region selected from the group consisting of the polymorphic region 

corresponding to position 957 of SEQ ID NO:6; the polymorphic region corresponding to 
Q position 1049 of SEQ ID NO:6; the polymorphic region corresponding to position 1164 
% of SEQ ID NO:6; the polymorphic region corresponding to position 1526 of SEQ ID 
*5 NO:6; the polymorphic region corresponding to position 1661 of SEQ ID NO:6; and the 
U 15 polymorphic region corresponding to position 1662 of SEQ ID NO:6. This embodiment 

??» may optionally further comprise a sequence determination oligonucleotide for detecting a 

y 3 

s polymorphic variant at any or all of the polymorphic sites corresponding to positions 957, 

U 1049, 1 164, 1526, 1661 and 1662 of SEQ ID NO:6. 

The kit of the invention may also comprise a polymerizing agent, for example, a 
O 20 thermostable nucleic acid polymerase such as those disclosed in U.S.Pat.Nos. 4,889,818; 
^ 6,077,664, and the like. The kit of the invention may also comprise chain elongating 

nucleotides, such as dATP, dTTP, dGTP, dCTP, and dITP, including analogs of dATP, 
dTTP, dGTP, dCTP and dTTP, so long as such analogs are substrates for a thermostable 
nucleic acid polymerase and can be incorporated into a growing nucleic acid chain. The 
25 kit of the invention may also include chain terminating nucleotides such as ddATP, 

ddTTP, ddGTP, ddCTP, and the like. In a preferred embodiment, the kit of the invention 
comprises at least two oligonucleotide primer pairs, a polymerizing agent, chain 
elongating nucleotides, at least two sequence determination oligonucleotides and at least 
one chain terminating nucleotide. The kit of the invention may optionally include 
30 buffers, vials, microtiter plates, and instructions for use. 
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The examples set forth below are provided as illustration and are not intended 
limit the scope and spirit of the invention as specifically embodied therein. 
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EXAMPLE 1 
IDENTIFICATION OF CYP3A4 POLYMORPHISM 

The study was performed in accordance with the principles stated in the 
Declaration of Helsinki as reviewed in Tokyo 1975 and Venice 1983, Hong Kong 1989 
5 and Somerset West 1996. Ten samples (Swedish Caucasians) were selected and used for 
identification of polymorphisms in the 5' flanking region of CYP3A4. 

White blood cells isolated from a blood sample drawn from the brachial vein 
serve as the source of the genomic DNA for the analyses. The DNA was extracted by 
guanidine thiocyanate method or QIAamp Blood Kit (QIAGEN, Venlo, The 
10 Netherlands). The genes included in the study were amplified by PCR and the DNA 
sequences were determined by full sequencing. All genetic analyses were performed 
f 3 according to Good Laboratory Practice and Standard Operating Procedures. Case Report 

% Forms were designed and used for clinical and genetic data collection. Data was entered 

?P and stored in a relational database at Gemini Genomics AB, Uppsala. To secure 

W 

jjU 15 consistency between the Case Report Forms and the database, data was checked either by 

fl double data entry or proofreading. After a Clean File was declared the database was 

s protected against changes. By using the program Stat/Transfer™ the database was 

m transferred to SAS data sets. The S AS™ system was used for tabulations and statistical 

^ evaluations. Genotypes were also correlated against the metabolic ratio. 

6 20 PCR-fragments were amplified with TaqGOLD polymerase (Applied Biosystems) 

pi using Robocycler (Stratagene) or GeneAmp PCR system 9700 (Applied Biosystems). 

Preferentially, the amplified fragments were 300-400 bp, and the region to be read did not 
exceed 300 bp. PCR reactions were carried out according to the basic protocol set forth in 
Table 1, with modifications as indicated in Table 2 for specific primer pairs, which are 
25 shown in Table 3. For the GeneAmp PCR 9700 machine the profile used was 10 minutes at 
95°, 40 x (45 seconds at 90°, 45 seconds at 60°, 45 seconds at 72°), 5 minutes at 72° and 22° 
until removed. 

30 
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Table 1 



Solution 


Stock Concentration 


PCR(^il) 


H 2 0 




33.2 


PCR buffer 


lOx 


5.0 


MgCl 2 


25 mM 


2.0 


dNTP 


2.5 mM 


2.5 


primer 1 


lOuM 


1.0 


primer 2 


lOuM 


1.0 


Taq-gold 
polymerase 


5u/ul 


0.3 


DNA samples 


2ng/ul 


5.0 


TOTAL 




50.0 



Table 2 



SEQID 
NO:s 


Polymorphic 
Site 


Modification from basic protocol (Table 1) 


Detection method 


7,8 


461 


62° annealing temperature 


Full sequencing 


9, 10 


461 


3 \xl MgCl 2 , 58° annealing temperature, 50 
cycles 


Full sequencing 



Table 3 



Polymorphic 
Site 


Primer Pair 


461 


SEQ ID NO:7 CCAGCCTGAAAGTGCAGAGA 
SEQ ID NO:8 TCTTAGAGTCTTTCCTCACCAAACT 


461 


SEQIDNO:9 CATGCCCTGTCTCTCCTTTA 
SEQ ID NO: 10 CCATCCCCTTCATGCAATC 



The optimized condition specified in Table 2 were required to distinguish CYP3A4 
from the closely related gene-family members CYP3A5, and CYP3A7. Use of the basic 
protocol will lead to problems when amplifying CYP3A4-specific amplicons of 300-400 bp 
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containing the polymorphisms of interest, unless a nested PCR approach is carried out. The 
nested PCR approach was not used because of the high risk of contamination when using a 
nested PCR approach and the high risk of typing errors as a consequence. The modifications 
shown in Table 2 were optimized and reaction parameters were balanced in such a way that 
nested PCR was avoided. 

For full sequencing, one of the PCR-primers in a primer pair was designed for 
sequencing by addition of a 29 nucleotide tail complementary to Ml 3 at its 5 '-end, 
namely the nucleotides AGTCACGACGTTGTAAAACGACGGCCAGT. Thus, the 
entire PCR-product was sequenced from the tailed PCR-primer. 

The additional oligonucleotides set forth in Tables 4 through 7 were identified as 
being suitable for detection of the SNP at positions 461 of the 5' flanking region of the 
CYP3A4 gene as depicted in SEQ ID NO:l. 

Table 4 sets forth oligonucleotides representing the coding (sense) strand 
complementary to the polymorphic region corresponding to the novel polymorphism 
found in the study population. The underlined letter indicates polymorphic position in the 
sequence context. All sequences are shown in 5' to 3' direction. 

Table 4 



Polymorphic 
Site 


Sequence 


Note 


461 


SEQ ID NO:ll: AGCACCCTGGT 
SEQ ID NO: 12: AGCACGCTGGT 


C variant 
G variant 



Table 5 sets forth oligonucleotides representing the non-coding (anti-sense) strand 
complementary to the polymorphic region corresponding to the novel polymorphism 
found in the study population. The underlined letter indicates polymorphic position in the 
sequence context. All sequences are shown in 5' to 3' direction. 

Table 5 



Polymorphic 
Site 


Sequence 


Note 


461 


SEQ ID NO: 13: ACCAGGGTGCT 
SEQ ID NO: 14: ACCAGCGTGCT 


Antisense G variant 
Antisense C variant 
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The sequences of Table 6 represent the 5'-sequence to the novel polymorphic site 
on the coding (sense) strand (SEQ ID NO: 15) and non-coding (anti-sense) strand (SEQ 
ID NO:s 16). All sequences are shown in 5' to 3' direction. 



Table 6 



Polymorphic 
Site 


Sequence 


Note 


461 


SEQ ID NO: 15: GTGTGTACAGC 
SEQ ID NO: 16: GCTGT ACACAC 


Sense 5' 
Antisense 5' 



The sequences of Table 7 represent the 3 '-sequence to the novel polymorphic site 
on the non-coding (anti-sense) strand (SEQ ID NO: 17) and the coding (sense) strand 
(SEQ ID NO: 18). All sequences are shown in 5' to 3' direction. 



Table 9 



Polymorphic 
Site 


Sequence 


Note 


461 


SEQ ID NO: 17: TGGTCCCTACC 
SEQ ID NO: 18: GGTAGGGACCA 


Antisense 3' 
Sense 3' 
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EXAMPLE 2 

IDENTIFICATION OF CYP2C9 POLYMORPHISMS 

The study was performed in accordance with the principles stated in the 
Declaration of Helsinki as reviewed in Tokyo 1975 and Venice 1983, Hong Kong 1989 
5 and Somerset West 1996. Ten samples (Swedish Caucasians) were selected and used for 
identification of polymorphisms in the 5' flanking region of CYP2C9. 

White blood cells isolated from a blood sample drawn from the brachial vein 
serve as the source of the genomic DNA for the analyses. The DNA is extracted by 
guanidine thiocyanate method or QIAamp Blood Kit (QIAGEN, Venlo, The 
10 Netherlands). The genes included in the study were amplified by PCR and the DNA 
sequences were determined by full sequencing. All genetic analyses were performed 
Q according to Good Laboratory Practice and Standard Operating Procedures. Case Report 

% Forms were designed and used for clinical and genetic data collection. Data was entered 
*P and stored in a relational database at Gemini Genomics AB, Uppsala. To secure 

U 15 consistency between the Case Report Forms and the database, data was checked either by 
\ t double data entry or proofreading. After a Clean File was declared the database was 

n protected against changes. By using the program Stat/Transfer™ the database was 

m transferred to SAS data sets. The SAS™ system was used for tabulations and statistical 

2 evaluations. Genotypes were also correlated against the metabolic ratio. 

6 20 PCR-fragments were amplified with TaqGOLD polymerase (Applied Biosystems) 

using Robocycler (Stratagene) or GeneAmp PCR system 9700 (Applied Biosystems). 
Preferentially, the amplified fragments were 300-400 bp, and the region to be read did not 
exceed 300 bp. PCR reactions were carried out according to the basic protocol set forth in 
Table 10, with modifications as indicated in Table 11 for specific primer pairs, which are 
25 shown in Table 12. For the GeneAmp PCR 9700 machine the profile used was 10 minutes 
at 95°, 40 x (45 seconds at 90°, 45 seconds at 60°, 45 seconds at 72°), 5 minutes at 72° and 
22° until removed. 
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Table 10 



Solution 


Stock Concentration 


PCR(^il) 


H 2 0 




55.1 


PCR buffer 


10x 




MgCl 2 


25 mM 


2.0 


dNTP 


2.5 mM 


2.5 


primer 1 


lO^M 


1.0 


primer 2 


IOhM 


1.0 


Taq-gold 
polymerase 


5\x/\il 


0.3 


DNA samples 


2ng/\i\ 


5.0 


TOTAL 




50.0 



Table 11 



SEQID 

NO:s 


Polymorphic Site 


Modification from basic protocol (Table 10) 


Detection method 


19,20 


957 


58° annealing temperature 


Full sequencing 


21,22 


957 & 1049 


3 \i\ MgCl 2) 62° annealing temperature 


Full sequencing 


23,24 


957, 1049 & 1164 


58° annealing temperature 


Full sequencing 


25,26 


1164 


3 ill MgCl 2l 62° annealing temperature , 50 
cycles 


Full sequencing 


27,28 


1526, 1661 & 1662 




Full sequencing 


29, 30 


1661 & 1662 


3 [il MgCl 2 , 62° annealing temperature , 50 
cycles 


Full sequencing 


31,32 


1661 & 1662 




Full sequencing 
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Table 12 



Polymorphic 
Site 


Primer Pair 


957 


SEQ ID NO: 19 CACTAGGGAATTTAGAACAAATATG 

ccn m \in-on nr APAr^A A A AOCtAAATTAT 


957 & 1049 


SEQ ID NO:21 TGTATTTAGATCCTCAACTCAGTATGT 


957, 1049 & 1164 


SEQ ID NO:23 GGTCCATTTAGTGATTTCCCTAC 
oun rn xwo/f ATAfA rr apa TTT ATTPTGTTP AT A 


1164 


SEQIDNO:25 CCAAATTTTTCCCTCAGTTACA 
SEQIDNO:26 TTGGTGCCACACAGCTCATA 


1526, 1661 & 1662 


SEQ ID NO:27 GCCTTCAGGAATTTTTTTTA 
SEQIDNO:28 CCAGTTGGGAATATATGATTTAACA 


1661 & 1662 


SEQK)NO:29 GCTGCTGTATTTTTAGTAGGCTATA 
SEQIDNO:30 CGTTCCATTGTCCACTCTGTAC 


1661 & 1662 


SEQ ID NO:3 1 TCAAGGCAGCTCTGGTGTAA 

SEQ ID NO:32 AGTTGGGAATATATGATTTAACAGA 



The optimized condition specified in Tablel 1 were required to distinguish CYP2C9 
from the closely related gene-family members CYP2C8, CYP2C18 and CYP2C19. Use of 
the basic protocol will lead to problems when amplifying C7P2C9-specific amplicons of 
300-400 bp containing the polymorphisms of interest, unless a nested PCR approach is 
carried out. The nested PCR approach was not used because of the high risk of 
contamination when using a nested PCR approach and the high risk of typing errors as a 
consequence. The modifications shown in Table 11 were optimized and reaction parameters 
were balanced in such a way that nested PCR was avoided. 

For full sequencing, one of the PCR-primers in a primer pair was designed for 
sequencing by addition of a 29 nucleotide tail complementary to M13 at its 5'-end, 
namely the nucleotides AGTCACGACGTTGTAAAACGACGGCCAGT. Thus, the 
entire PCR-product was sequenced from the tailed PCR-primer. 

The additional oligonucleotides set forth in Tables 13 through 16 were identified 
as being suitable for detection of the SNPs at positions 957, 1049, 1164, 1526, 1661 
and/or 1662 of the 5* flanking region of the CYP2C9 gene as depicted in SEQ ID NO:6. 
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Table 13 sets forth oligonucleotides representing the coding (sense) strand 
complementary to the polymorphic region corresponding to the polymorphisms found i 
the study population. The underlined letter indicates polymorphic position in the 
sequence context. All sequences are shown in 5' to 3' direction. 

Table 13 



Polymorphic 
Site 


Sequence 


Note 


957 


SEQ ID NO:33: ATCTTCTATTG 
SEQ ID NO:34: ATCTTTTATTG 


C variant 
T variant 


1049 


SEQ ID NO:35: ACAATAGAAAG 
SEQ ID NO:36: ACAATGGAAAG 


A variant 
G variant 


1164 


SEQ ID NO:37: ATGGAGAAGGG 
SEQ ID NO:38: ATGGAAAAGGG 


G variant 
A variant 


1526 


SEQIDNO:39: TTAATGGTAAA 
SEO ID NO:40: TTAATTGTAAA 


G variant 
T variant 


1661 & 1662 


SEQIDNO:41: GGATTTCATTAT 
SEQ ID NO:42: GGATTAAATTAT 


TC variants 
AA variants 



Table 14 sets forth oligonucleotides representing the non-coding (anti-sense) 
strand complementary to the polymorphic region corresponding to the polymorphisms 
found in the study population. The underlined letter indicates polymorphic position in the 
sequence context. All sequences are shown in 5' to 3' direction. 

Table 14 



Polymorphic 
Site 


Sequence 


Note 


957 


SEQ ID NO:43: CAATAGAAGAT 
SEO ID NO:44: CAATAAAAGAT 


Antisense G variant 
Antisense A variant 


1049 


SEQIDNO:45: CTTTCTATTGT 
SEO ID NO:46: CTTTCCATTGT 


Antisense T variant 
Antisense C variant 


1164 


SEQIDNO:47: CCCTTCTCCAT 
SEQ ID NO:48: CCCTTTTCCAT 


Antisense C variant 
Antisense T variant 


1526 


SEQ ID NO:49: TTTACCATTAA 
SEO ID NO:50: TTTACAATTAA 


Antisense C variant 
Antisense A variant 


1661 & 
1662 


SEQ ID NO:5 1 : AT AATGAAATCC 
SEQ ID NO:52: ATAATTTAATCC 


Antisense GA variants 1 
Antisense TT variant | 



The sequences of Table 15 represent the 5 '-sequence to the polymorphic sites on 
the coding (sense) strand (SEQ ID NO:s 53-57) and non-coding (anti-sense) strand (SEQ 
ID NO:s 58-67). All sequences are shown in 5' to 3' direction. 
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Table 15 



Polymorphic 
Site 


Sequence 


Note 1 

Sense 5' 
Antisense 5' 


957 


SEQ ID NO;53: TACCTCCCATC 
SEO ID NO:58: GATGGGAGGTA 


1049 


SEQIDNO:54: AACCAAAAACA 
SEQ ID NO:59: TGTTTTTGGTT 


Sense 5' 
Antisense 5' 


1164 


SEQ ID NO:55: CTGCAGTGATG 
SEOIDNO:60: CATCACTGCAG 


Sense 5" 
Antisense 5' 


1526 


SEQ ID NO:56: TAGGGGGTTTA 
SEQ ID NO:61 : TAAACCCCCTA 


Sense 5* 
Antisense 5' 


1661 & 1662 


SEQ ID NO:57: ATTTGAAAGGA 
SEQ ID NO:62: TCCTTTCAAAT 


Sense 5' 
Antisense 5* 



5 The sequences of Table 16 represent the 3'-sequence to the polymorphic sites on 

the non-coding (anti-sense) strand (SEQ ID NO:s 68-72) and the coding (sense) strand 
(SEQ ID NO:s 73-77). All sequences are shown in 5' to 3' direction. 



Polymorphic 
Site 


Table 16 

Sequence 


Note 


957 


SEQ ID NO:63: TGTGGATGCAA 
SEQIDNO:68: TTGCATCCACA 


Antisense 3' 
Sense 3' 


1049 


SEQ ID NO:64: CATGGCTGCTT 
SEQ ID NO:69: AAGCAGCCATG 


Antisense 3' 
Sense 3' 


1164 


SEQ ID NO:65 : AGGGATCTCCC 
SEQ ID NO:70: GGGAGATCCCT 


Antisense V 
Sense 3' 


1526 


SEQIDNO:66: TAAACACCTTT 
SEQ ID NO:71 : AAAGGTGTTTA 


Antisense 3' 
Sense 3' 


1661 & 1662 


SEQ ID NO:67: TGTTCTTTATA 
SEQ ID NO:72: TATAAAGAACA 


Antisense 3' 
Sense 3' 
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